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The invention disclosed herein was made with Government 
support under Grant No. R01GM5514 7-01 from the National 
Institutes of Health of the United States Department of 
Health and Human Services. Accordingly, the U.S. 
Government has certain rights in this invention. 



BACKGROUND 



15 Throughout this application, various publications are 

referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the 
end of the specification immediately preceding Sequence 
Listing and the claims. The disclosures of these 

2 0 publications in their entireties are hereby incorporated 

by reference into this application in order to more fully 
describe the state of the art as known to those skilled 

therein as of the date of the invention described and 

\ 

claimed^ herein. 

25 

Fas (APO-1/CD95) and its ligand have been identified as 
important signal -mediators of apoptosis (Itoh, et al . 
1991) The structural organization of Fas (APO-1/CD95) has 
suggested that it is a member of the tumor necrosis 

3 0 factor receptor superfamily, which also includes the p75 

nerve growth factor receptor (NGFR) (Johnson, et al . 
1986), the T-cell-activation marker CD27 (Camerini, et 
al . 1991), the Hodgkin-lymphoma-associated antigen CD30 

(Smith, et al . (1993), the human B cell antigen CD40 
35 (Stamenkovic, et al . 1989), and T cell antigen OX40 

(Mallett, et al , 1990). Genetic mutations of both Fas 
and its ligand have been associated with 
lymphoprolif erative and autoimmune disorders in mice 

(Watanabe-Fukunaga, et al . 1992; Takahashi, et al . 1994). 



40 
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Furthermore, alterations of Fas expression level have 
been thought to lead to the induction of apoptosis in 
T-cells infected with human immunodeficiency virus (HIV) 
(Westendorp, et al . 1995) . 

5 

Several Fas -interacting signal transducing molecules, 
such as Fas-associated phosphatase-1 (FAP-1) (Figure 1) 
(Sato, et al. 1995) FADD/MORTl/CAP- l/CAP-2 (Chinnaiyan, 
et al. 1995; Boldin, et al . 1995; Kischkel, et al . 1995) 

10 and RIP (Stanger, et al . 1995), have been identified 

using yeast two-hybrid and biochemical approaches. All 
but FAP-1 associate with the functional cell death domain 
of Fas and overexpression of FADD/MORTl or RIP induces 
apoptosis in cells transf ected with these proteins . In 

15 contrast, FAP-1 is the only protein that associates with 

the negative regulatory domain (C-terminal 15 amino 
acids) (Ito, et al . 1993) of Fas and that inhibits 
Fas-induced apoptosis. 

20 FAP-1 (PTPN13) has several alternatively-spliced forms 

that are identical to PTP-BAS/hPTPlE/PTPLl , (Maekawa, et 
al. 1994; Banville, et al . 1994; Saras, et al . 1994) and 
contains a membrane -binding region similar to those found 
in the cytoskeleton-associated proteins, ezrin, (Gould et 

25 al . 1989) radixin (Funayama et al . 1991) moesin (Lankes, 

et al . 1991), neurofibromatosis type II gene product 
(NFII) (Rouleau, et al . 1993), and protein 4.1 (Conboy, 
et al. 1991), as well as in the PTPases PTPHl (Yang, et 
al . 1991), PTP-MEG (Gu, et al . 1991), and PTPDl (Vogel, 

30 et al . 1993). FAP-1 intriguingly contains six GLGF 

(PDZ/DHR) repeats that are thought to mediate intra -and 
inter-molecular interactions among protein domains. The 
third GLGF repeat of FAP-1 was first identified as a 
domain showing the specific interaction with the 

35 C-terminus of Fas receptor (Sato, et al , 1995). This 

suggests that the GLGF domain may play an important role 
in targeting proteins to the submembranous cytoskeleton 
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and/or in regulating biochemical activity. GLGF repeats 
have been previously found in guanylate kinases, as well 
as in the rat post-synaptic density protein (PSD- 95) (Cho, 
et al . 1992), which is a homolog of the Drosophila tutnor 
5 suppressor protein, lethal- (1) -disc-large-1 [digr-1] 

(Woods, et al 1991; Kitamura, et al . 1994). These 
repeats may mediate homo- and hetero-dimerization, which 
could potentially influence PTPase activity, binding to 
Fas, and/or interactions of FAP-1 with other signal 
10 transduction proteins. Recently, it has also been 

reported that the different PDZ domains of proteins 
interact with the C-terminus of ion channels and other 
proteins (Figure 1) (TABLE 1) (Kornau, et al . 1995; Kim, 
et al. 1995; Matsumine, et al . 1996). 

15 

TABLE 1. Proteins that interact with PDZ domains. 



Protein 


C- tenninal 
sequence 


Associated 
protein 


Reference 


Fas (APO-1/CD95) 


SLV 


FAP-1 


2 


ISTMDA receptor 
ITR2 subunit 


SDV 


PSD95 


3 


Shaker-type K+ 
channel 


TDV 


PSD95 & DLG 


4 


APC 


TEV 


DLG 


5 
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SUMMARY OF THE INVENTION 

This invention provides a composition capable of 
inhibiting specific binding between a signal- transducing 
5 protein and a cytoplasmic protein containing the amino 

acid sequence (G/S/A/E) -L-G- (F/l/L) (Sequence I.D. No.: 
1) . Further, the cytoplasmic protein may contain the 
amino acid sequence (K/R/Q) -X^- (G/S/A/E) -L-G- (F/I/L) 
(Sequence I.D. No.: 2), wherein X represents any amino 

10 acid which is selected from the group comprising the 

twenty naturally occurring amino acids and n represents 
at least 2, but not more than 4. In a preferred 
embodiment, the amino acid sequence is SLGI (Sequence 
I.D, No.: 3). Further, the invention provides for a 

15 composition when the signal- transducing protein has at 

its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) (Sequence I.D. No.: 4), wherein each - represents 
a peptide bond, each parenthesis encloses amino acids 
which are alternatives to one other, each slash within 

20 such parentheses separating the alternative amino acids, 

and the X represents any amino acid which is selected 
from the group comprising the twenty naturally occurring 
amino acids. 

25 This invention also provides for a method of identifying 

a compound capable of inhibiting specific binding between 
a signal-transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . 
Further this invention provides for a method of 

30 identifying a compound capable of inhibiting specific 

binding between a signal- transducing protein having at 
its carboxyl teirminus the amino acid sequence (S/T) -X- 
(V/L/I) and a cytoplasmic protein. 

35 This invention also provides for a method inhibiting the 

proliferation of cancer cells, specifically, where the 
cancer cells are derived from organs comprising the 
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colon, liver, breast, ovary, testis, lung, stomach, 
spleen, kidney, prostate, uterus, skin, head, thymus and 
neck, or the cells are derived from either T-cells or B- 
cells . 

5 

This invention also provides for a method of treating 
cancer in a subject in an amount of the composition of 
effective to result in apoptosis of the cells, 
specifically, where the cancer cells are derived from 
10 organs comprising the thymus, colon, liver, breast, 

ovary, testis, lung, stomach, spleen, kidney, prostate, 
uterus, skin, head and neck, or the cells are derived 
from either T-cells or B-cells. 

15 This invention also provides for a method of inhibiting 

the proliferation of virally infected cells, specifically 
wherein the virally infected cells are infected with the 
Hepatitis B virus, Epstein-Barr virus, influenza virus. 
Papilloma virus. Adenovirus, Human T-cell lymphtropic 

20 virus, type 1 or HIV. 

This invention also provides a pharmaceutical composition 
comprising compositions capable of inhibiting specific 
binding between a signal- transducing protein and a 
25 cytoplasmic protein. 

This invention also provides a pharmaceutical composition 
comprising compounds identified to be capable of 
inhibiting specific binding between a signal -transducing 
3 0 protein and a cytoplasmic protein. 
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BRIEF DESCRIPTION OF THE FIGURES 



Figure 1. Diagram of Fas-associated phosphatase-1 
protein, showing the six GLGF (PDZ/DHR) domain repeats ; 
5 comparison of similar membrane binding sites with other 

proteins and proteins that contain GLGF (PDZ/DHR) 
repeats . 



Figures 2A, 2B, 2C and 2D. Mapping of the minimal region 
10 of the C-terminal of Fas required for the binding to 

FAP-1. Numbers at right show each independent clone 
(Figures 2C and 2D) . 

2A. Strategy for screening of a random peptide library 

by the yeast two-hybrid system. 
15 2B. Alignment of the C-terminal 15 amino acids of Fas 

between human (Sequence I.D. No.: 5), rat (Sequence 

I.D. No. : 6) , and mouse (Sequence I.D. No. : 7) . 
2C, The results of screening a semi -random peptide 

library. Top row indicates the amino acids which 
2 0 were fixed based on the homology between human and 

rat. Dash lines show unchanged amino acids. 
2D. The results of screening a random peptide library 

(Sequence I.D. 

Sequence I.D. 
2 5 Sequence I.D. 

Sequence I.D. 

Sequence I.D. 

respectively) . 



No. 


: 8, 


Sequence 


I 


.D. 


No, : 


9, 


No. : 


10, 


Sequence 


I . 


D. 


No. : 


11, 


No. : 


12, 


Sequence 


I . 


D. 


No. : 


13, 


No. : 


14, 


Sequence 


I . 


D. 


No. : 


15, 


No. : 


16, 


Sequence 


I . 


D. 


No. : 


17, 



3 0 Figures 3A, 3B and 3C. Inhibition assay of Fas/FAP-1 

binding In vitro. 

3A. Inhibition assay of Fas/FAP-1 binding using the 
C-terminal 15 amino acids of Fas. GST-Fas fusion 
protein (191-355) was used for in vitro binding 
35 assay (lane 1, 3-10) . GST-Fas fusion protein 

(191-320) (lane. 2) and 1 mM human PAMP (N-terminal 
2 0 amino acids of proadrenomedullin, M.W. 2460.9) 



# 
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10 



15 



20 



25 



30 



(lane 3) were used as negative controls. The 
concentrations of the C-terminal 15 amino acids 
added were 1 (lane 4), 3 (lane 5), 10 (lane 6), 30 
(lane 7), 100 (lane 8), 300 (lane 9), and 1000 /xM 
(lane 10) . 

3B. Inhibition assay of Fas/FAP-1 binding using the 
truncated peptides corresponding to the C-terminal 
15 amino acids of Fas. All synthetic peptides were 
acetylated for this inhibition assay (Sequence I.D. 
No,: 4, Sequence I.D. No.: 18, Sequence I.D. No.: 
19, Sequence I.D. No.: 20, Sequence I.D. No.: 21, 
Sequence I.D. No.: 22, Sequence I.D. No.: 23, 
respectively) , 

3C. Inhibitory effect of Fas/FAP-1 binding using the 
scanned tripeptides. 

Figures 4A, 4B, 4C and 4D. 

4A. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in yeast. 
4B. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in vitro. 
4C. Immuno-precipitation of native Fas with GST-FAP-1. 
4D. Inhibition of Fas/FAP-1 binding with Ac-SLV or Ac- 

SLY. 

Figures 5A, SB, 5C, 5D, 5E and 5F- Microinjection of 
Ac-SLV into the DLD-l cell line. Triangles identify the 
cells both that were could be microinjected with Ac-SLV 
and that showed condensed chromatin identified. On the 
other hand, only one cell of the area appeared apoptotic 
when microinjected with Ac -SLY. 

5A. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CHll are 
shown in phase contrast . 

5B. Representative examples of the cells microinjected 



with AC- SLY in the presence of 50 0 ng/ml CHll are 
shown in phase contrast . 
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5C. Representative examples of the cells microin j ected 
with Ac-SLtV in the presence of 500 ng/ml CHll are 
shown stained with FITC. 
5D. Representative examples of the cells microin j ected 
5 with AC-SLY in the presence of 500 ng/ml CHll are 

shown stained with FITC, 
5E. Representative examples of the cells microin j ected 
with Ac-SLV in the presence of 500 ng/ml CHll are 
shown with fluorescent DNA staining with Hoechst 
10 33342. 

5F. Representative examples of the cells microinjected 
with AC-SLY in the presence of 500 ng/ml CHll are 
shown in fluorescent DNA staining with Hoechst 
33342 . 



15 



Figure 6. Quantitation of apoptosis in microinjected 
DLD-1 cells. 



Figures 7A, 7B, 7C, 7D, 7E, 7F, 7G, and 7H, 

20 7A. Amino acid sequence of human nerve growth factor 

receptor (Sequence I.D. No.: 24). 
7B • Amino acid sequence of human CD4 receptor (Sequence 
I.D. No. 25) . 

7C. The interaction of Fas-associated phosphatase-1 to 
25 the C-terminal of nerve growth factor receptor 

(NGFR) (p75) , 

7D. Amino acid sequence of human colorectal mutant 

cancer protein (Sequence I.D. No.: 26) . ^%6-'^pW^'^7 
*> 7E . Amino acid sequence of protein kinase C, alpha type. ^ 
3 0 7F. Amino acid sequencg^ of serotonin 2A receptor 



(Sequence I,D. No.: .3^) 

A 



7G. Amino acid sequence^^ of serotonin 2B receptor 

ijy (Sequence I.D. No.:^2^. 

7H. Amino acid sequence of adenomatosis polyposis coli 

1^ protein (Sequence I.D. No.:^^^. 



m 
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FigfVLTS 8- Representation of the structural 
characteristics of p75 NGFR (low-affinity nerve growth 
factor receptor) . 

5 Figure 9. Comparison of the C-terrninal ends of Fas and 

p75 NGFR. 

Figure 10. In vitro interaction of ^^S-labeled FAP-1 with 
various receptors expressed as GST fusion proteins . The 
10 indicated GST fusion proteins immobilized on glutathione - 

Sepharose beads were incubated with in vitro translated, 
^^S-labeled FAP-1 protein. After the beads were washed, 
retained FAP-1 protein was analyzed by SDS-PAGE and 
autoradiography . 

15 

Figures llA and IIB. In vitro interaction ^^S-labeled 
FAP-1 with GST-p75 deletion mutants. 

IIA. Schematic representation of the GST fusion 
proteins containing the cytoplasmic domains of 

20 p75 and p75 deletion mutants. Binding of FAP- 

1 to the GST fusion proteins with various p75 
deletion mutants is depicted at the right and 
is based on data from (llB) . 

IIB. Interaction of in vitro translated, ^^S- labeled 
25 FAP-1 protein with various GST fusion proteins 

immobilized on glutathione -Sepharose beads. 
After the beads were washed, retained FAP-1 
protein was analyzed by SDS-PAGE and 
autoradiography . 

30 

Figure 12. The association between LexA-C- terminal 
cytoplasmic region of p75NGFR and VP16-FAP-1. The 

indicated yeast strains were constructed by 
transformation and the growth of colonies was tested. 
35 +/- indicates the growth of colonies on his ' plate. 



wo 98/05347 



-10- 



PCT/US97/12677 



DETAILED DESCRIPTION OF THE INVENTION 



As used herein, amino acid residues are abbreviated as 
5 follows: A, Ala; C, Cys; D, Asp; Glu; F, Phe; G, Gly; 

H, His; I, lie; K, Lys ; L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, 
Tyr . 

10 In order to facilitate an understanding of the material 

which follows, certain frequently occurring methods 
and/or terms are best described in Sambrook, et al . , 
1989 . 

15 The present invention provides for a composition capable 

of inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein containing 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , wherein 
each - represents a peptide bond, each parenthesis 

2 0 encloses amino acids which are alternatives to one other, 

and each slash within such parentheses separating the 
alternative amino acids. Further, the cytoplasmic 
protein may contain the amino acid sequence (K/R/Q) -X^- 
(G/S/A/E) -L-G- (F/I/L) , wherein X represents any amino acid 

25 which is selected from the group comprising the twenty 

naturally occurring amino acids and n represents at least 
2, but not more than 4. Specifically, in a preferred 
embodiment, the cytoplasmic protein contains the amino 
acid sequence SLGI . 

30 

The amino acid sequence (K/R/Q) -X^- (G/S/A/E) -L-G- (F/I/L) 
is also well-known in the art as "GLGF (PDZ/DHR) amino 
acid domain." As used herein, "GLGF (PDZ/DHR) amino acid 
domain" means the amino acid sequence (K/R/Q) -X„- 
35 (G/S/A/E) -L-G- (F/I/L) . 



In a preferred embodiment, the signal -transducing protein 
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has at its carboxyl terminus the amino acid sequence 
(S/T) -X- (V/I/L) , wherein each - represents a peptide 
bond, each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
5 parentheses separating the alternative amino acids, and 

the X represents any amino acid which is selected from 
the group comprising the twenty naturally occurring amino 
acids . 

10 The compositions of the subject invention may be, but not 

limited to, antibodies, inorganic compounds, organic 
compounds, peptides, peptidomimetic compounds, 

polypeptides or proteins, fragments or derivatives which 
share some or all properties, e.g. fusion proteins. The 

15 composition may be naturally occurring and obtained by 

purification, or may be non-naturally occurring and 
obtained by synthesis. 



Specifically, the composition may be a peptide containing 
20 the sequence (S/T) -X- (V/I/L) -COOH, wherein each 

represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, each 
slash within such parentheses separating the alternative 
amino acids, the X represents any amino acid which is 
25 selected from the group comprising the twenty naturally 

occurring amino acids. In preferred embodiments, the 
peptide contains one of the following sequences: 
DSENSNFRNEIQSLV, RNEIQSIjV, NEIQSLV, EIQSLV, IQSLV, QSLV, 
SLV , I PPDSEDGNEEQSLV , DSEMYNFRSQLASW , IDLASEFLFLSNSFL , 
3 0 PPTCSQANSGRISTL, SDSNMNMNELSEV, QNFRTYIVSFV, RETIESTV, 

RGFISSLV, TIQSVI, ESLV, A further preferred embodiment 
would be an organic compound which has the sequence Ac- 
SLiV-COOH, wherein the Ac represents an acetyl and each - 
represents a peptide bond, 

35 



An example of the subject invention is provided infra . 
Acetylated peptides may be automatically synthesized on 
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an Advanced ChemTech ACT357 using previously published 
procedures by analogy. Wang resin was used for each run 
and N^'-Fmoc protection was used for all amino acids, and 
then 20% piperidine/DMF and coupling was completed using 
5 DIC/HOBt and subsequently HBTU/DIEA. After the last 

amino acid was coupled, the growing peptide on the resin 
was acetylated with ACjO/DlVIF . The acetylated peptide was 
purified by HPLC and characterized by FAB-MS and ^H-NNR. 

10 Further, one skilled in the art would know how to 

construct derivatives of the above-described synthetic 
peptides coupled to non-acetyl groups, such as amines. 

This invention also provides for a composition capable of 
15 inhibiting specific binding between a signal-transducing 

protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/l/L) , wherein each - represents a 
peptide bond, each parenthesis encloses amino acids which 
are alternatives to one other, each slash within such 
20 parentheses separating the alternative amino acids, the 

X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein. 

25 The compositions of the subject invention includes 

antibodies, inorganic compounds, organic compounds, 
peptides, pept idomimetic compounds, polypeptides or 
proteins, fragments or derivatives which share some or 
all properties, e.g. fusion proteins. 

30 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal-transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , 
35 wherein each - represents a peptide bond, each 

parenthesis encloses amino acids which are alternatives 
to one other, each slash within such parentheses 
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separating the alternative amino acids, which comprises 
(a) contacting the cytoplasmic protein bound to the 
signal -transducing protein with a plurality of compounds 
under conditions permitting binding between a known 
5 compound previously shown to be able to displace the 

signal- transducing protein bound to the cytoplasmic 
protein and the bound cytoplasmic protein to form a 
complex; and (b) detecting the displaced signal- 
transducing protein or the complex formed in step (a) 
10 wherein the displacement indicates that the compound is 

capable of inhibiting specific binding between the 
signal- transducing protein and the cytoplasmic protein. 

The inhibition of the specific binding between the 
15 signal -transducing protein and the cytoplasmic protein 

may affect the transcription activity of a reporter gene . 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 

20 activity of a reporter gene before and after the 

contacting with the compound in step (a) , where a change 
of the activity indicates that the specific binding 
between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the signal - 

25 transducing protein is displaced. 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 

30 signal- transducing protein and the cytoplasmic protein 

are bound. One can also identify the compound by 
detecting other biological functions dependent on the 
binding between the signal- transducing protein and the 
cytoplasmic protein. Examples of reporter genes are 

3 5 numerous and well-known in the art, including, but not 

limited to, histidine resistant genes, ampicillin 
resistant genes, )8-galactosidase gene. 
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Further the cytoplasmic protein may be bound to a solid 
support. Also the compound may be bound to a solid 
support and comprises an antibody, an inorganic compound, 
an organic compound, a peptide, a peptidomimetic 
5 compound, a polypeptide or a protein. 



An example of the method is provided infra . One can 
identify a compound capable of inhibiting specific 
binding between the signal- transducing protein and the 

10 cytoplasmic protein using direct methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound to a detectable marker. Further, 
one could use indirect methods of detection that would 
detect the increase or decrease in levels of gene 

15 expression. As discussed infra, one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into the L4 0- strain 
with an appropriate cell line having an appropriate 
reporter gene. One could then detect whether inhibition 

2 0 had occurred by detecting the levels of expression of the 

reporter gene. In order to detect the expression levels 
of the reporter gene, one skilled in the art could employ 
a variety of well-known methods, e.g. two-hybrid systems 
in yeast, mammals or other cells. 

25 

Further, the contacting of step (a) may be in vitro , in 
vivo, and specifically in an appropriate cell, e.g. yeast 
cell or mammalian cell. Examples of mammalian cells 
include, but not limited to, the mouse fibroblast cell 
30 NIH 3T3, CHO cells, HeLa cells, Ltk' cells, Cos cells, 

etc 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
35 (including gram positive cells) , fungal cells, insect 

cells, and other animals cells. 
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Further, the signal- transducing protein may be a cell 
surface receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and may be expressed in cells 
5 derived from organs including, but not limited to, 

thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, lung, stomach, prostate, uterus, skin, head, and 
neck, or expressed in cells comprising T-cells and B- 
cells. In a preferred embodiment, the T-cells are Jurkat 
10 T-cells, 

Further, the cell -surface receptor may be a CD4 receptor, 
p75 receptor, serotonin 2A receptor, or serotonin 2B 
receptor . 

15 

Further, the signal transducer protein may be Protein 
Kinase-C-or-type . 

Further, the tumor suppressor protein may be a 

2 0 adenomatosis polyposis coli tumor suppressor protein or 

colorectal mutant cancer protein. 

Further, the cytoplasmic protein contains the amino acid 
sequence SLGI , specifically Fas-associated phosphatase-1 . 

25 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal- transducing protein having at its carboxyl 
terminus the amino acid sequence (S/T) -X- (V/I/L) , wherein 

3 0 each - represents a peptide bond, each parenthesis 

encloses amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
alternative amino acids, the X represents any amino acid 
which is selected from the group comprising the twenty 
35 naturally occurring amino acids, and a cytoplasmic 

protein which comprises (a) contacting the signal- 
transducing protein bound to the cytoplasmic protein with 
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a plurality of compounds under conditions permitting 
binding between a knovm compound previously shown to be 
able to displace the cytoplasmic protein bound to the 
signal -transducing protein and bound signal- transducing 
5 protein to form a complex; and (b) detecting the 

displaced cytoplasmic protein or the complex of step (a) , 
wherein the displacement indicates that the compound is 
capable of inhibiting specific binding between the 
signal- transducing protein and the cytoplasmic protein. 

10 The inhibition of the specific binding between the 

signal -transducing protein and the cytoplasmic protein 
affects the transcription activity of a reporter gene. 
Further, in step (b) , the displaced signal- transducing 
protein or the complex is detected by comparing the 

15 transcription activity of a reporter gene before and 

after the contacting with the compound in step (a) , where 
a change of the activity indicat es that the specific 
binding between the signal- transducing protein and the 
cytoplasmic protein is inhibited and the cytoplasmic 

20 protein is displaced. 



Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 
activity of a reporter gene before and after the 
25 contacting with the compound in step (a) , where a change 

of the activity indicates that the specific binding 
between the signal- transducing protein and the 
cytoplasmic protein is inhibited and the signal- 
transducing protein is displaced. 

30 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 
signal-transducing protein and the cytoplasmic protein 
35 are bound. One can also identify the compound by 

detecting other biological functions dependent on the 
binding between the signal- transducing protein and the 
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cytoplasmic protein. Examples of reporter genes are 
numerous and well-known in the art^ including, but not 
limited to, histidine resistant genes, ampicillin 
resistant genes, )8-galactosidase gene. 

5 

Further, the cytoplasmic protein may be bound to a solid 
support or the compound may be bound to a solid support, 
comprises an antibody, an inorganic compound, an organic 
compound, a peptide, a peptidomimetic compound, a 
10 polypeptide or a protein. 

An example of the method is provided infra. One could 
identify a compound capable of inhibiting specific 
binding between the signal-transducing protein and the 

15 cytoplasmic protein using direct "methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound with a detectable marker. 
Further, one could use indirect methods of detection that 
would detect the increase or decrease in levels of gene 

20 expression. As discussed infra, one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into L40 -strain 
with an appropriate cell line having a reporter gene. 
One could then detect whether inhibition had occurred by 

25 detecting the levels of the reporter gene. Different 

methods are also well known in the art, such as employing 
a yeast two-hybrid system to detect the expression of a 
reporter gene . 

30 Further the contacting of step (a) can be in vitro or in 

vivo, specifically in a yeast cell or a mammalian cell. 
Examples of mammalian cells include, but not limited to, 
the mouse fibroblast cell NIH 3T3 , CHO cells, HeLa cells, 
Ltk" cells, Cos cells, etc. 



35 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 



m 
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(including gram positive cells) , fungal cells, insect 
cells, and other animals cells. 



Further, the signal -transducing protein is a cell surface 
5 receptor, signal transducer protein, or a tumor 

suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and is expressed in cells 
derived from organs comprising thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
10 uterus, skin, head and neck, or expressed in cells 

comprising T- cells and B-cells . In a preferred 

embodiment, the T-cells are Jurkat T-cells. 

Further, the cell -surface receptor may be a CD4 receptor, 
15 p75 receptor, serotonin 2A receptor, or serotonin 2B 

receptor . 

Further, the signal transducer protein may be Protein 
Kinase-C-Qf- type . 

20 

Further, the tumor suppressor protein may be a 
adenomatosis polyposis coli tumor suppressor protein or 
colorectal mutant cancer protein. 

25 Further, the cytoplasmic protein contains the amino acid 

sequence SLGI, specifically Fas-associated phosphatase- 
1 . 

This invention also provides a method of inhibiting the 
3 0 proliferation of cancer cells comprising the above- 

described composition, specifically, wherein the cancer 
cells are derived from organs including, but not limited 
to, thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, stomach, prostate, uterus, skin, head and neck, 
35 or wherein the cancer cells are derived from cells 

comprising T-cells and B-cells. 
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This invention also provides a method of inhibiting the 
proliferation of cancer cells comprising the compound 
identified by the above -described method, wherein the 
cancer cells are derived from organs including, but not 
5 limited to, thymus, liver, kidney, colon, ovary, breast, 

testis, spleen, stomach, prostate, uterus, skin, head and 
neck, or wherein the cancer cells are derived from cells 
comprising T-cells and B-cells. 

The invention also provides a method of treating cancer 
in a subject which comprises introducing to the subject's 
cancerous cells an amount of the above -described 
composition effective to result in apoptosis of the 
cells, wherein the cancer cells are derived from organs 
including, but not limited to, thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
uterus, skin, head and neck, or wherein the cancer cells 
are derived from cells comprising T-cells and B-cells. 

As used herein "apoptosis" means programmed cell death of 
the cell. The mechanisms and effects of programmed cell 
death differs from cell lysis. Some observable effects 
of apoptosis are : DNA fragmentation and disintegration 
into small membrane -bound fragments called apoptotic 
bodies . 

Means of detecting whether the composition has been 
effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
3 0 morphological change of chromatin using either phase 

contrast or fluorescence microscopy. 

The invention also provides for a method of inhibiting 
the proliferation of virally infected cells comprising 
3 5 the above -described composition or the compound 

identified by the above -described, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein-Barr 



10 
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virus, influenza virus. Papilloma virus, Adeno virus. 
Human T-cell lymphtropic virus, type 1 or HIV. 

The invention also provides a method of treating a 
5 virally-inf ected subject which comprises introducing to 

the subject's virally- infected cells the above -described 
composition effective to result in apoptosis of the cells 
or the compound identified by the above -described method 
of claim 27 effective to result in apoptosis of the 
10 cells, wherein the virally infected cells comprise the 

Hepatitis B virus, Epstein-Barr virus, influenza virus. 
Papilloma virus, Adeno virus. Human T-cell lymphtropic 
virus, type 1 or HIV. 

15 Means of detecting whether the composition has been 

effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
morphological change of chromatin using either phase 
contrast or fluorescence microscopy. 



20 



25 



30 



This invention also provides for a pharmaceutical 
composition comprising the above -described composition of 
in an effective amount and a pharmaceutically acceptable 
carrier . 

This invention also provides for a pharmaceutical 
composition comprising the compound identified by the 
above-described method of in an effective amount and a 
pharmaceutically acceptable carrier. 



This invention further provides a composition capable of 
specifically binding a signal- transducing protein having 
at its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) , wherein each - represents a peptide bond, each 
35 parenthesis encloses amino acids which are alternatives 

to one other, each slash within such parentheses 
separating the alternative amino acids, and the X 



• 
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represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids. The composition may contain the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each - represents 
5 a peptide bond, each parenthesis encloses amino acids 

which are alternatives to one other, and each slash 
within such parentheses separating the alternative amino 
acids. In a preferred embodiment, the composition 
contains the amino acid sequence (K/R/Q) -X^- (G/S/A/E) -L-G- 
10 (F/I/L) . wherein X represents any amino acid which is 

selected from the group comprising the twenty naturally 
occurring amino acids and n represents at least 2, but 
not more than 4. In another preferred embodiment, the 
composition contains the amino acid sequence SLGI . 

15 

This invention further provides a method for identifying 
compounds capable of binding to a signal- transducing 
protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/L/I) , wherein each - represents a 

2 0 peptide bond, each parenthesis encloses amino acids which 

are alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, the 
X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
25 acids, which comprises (a) contacting the signal - 

transducing protein with a plurality of compounds under 
conditions permitting binding between a known compound 
previously shown to be able to bind to the signal - 
transducing protein to form a complex; and (b) detecting 

3 0 the complex formed in step (a) so as to identify a 

compound capable of binding to the signal -transducing 
protein. Specifically, the identified compound contains 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . In a 
further preferred embodiment, the identified compound 
35 contains the amino acid sequence SLGI . 



Further, in the above -described method, the signal- 
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transducing protein may be bound to a solid support . 
Also, the compound may be bound to a solid support, and 
may comprise an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic compound, 
5 a polypeptide or a protein. 



Further, the signal -transducing protein may be a cell- 
surface receptor or a signal transducer. Specifically, 
the signal -transducing protein may be the Fas receptor, 
10 CD4 receptor, p75 receptor, serotonin 2A receptor, 

serotonin 2B receptor, or protein kinase-C-of- type . 

This invention also provides a method of restoring 
negative regulation of apoptosis in a cell comprising the 
15 above-described composition or a compound identified by 

the above -described method. 



As used herein "restoring negative regulation of 
apoptosis" means enabling the cell from proceeding onto 
2 0 programmed cell death. 

For example, cells that have functional Fas receptors and 
Fas -associated phosphatase 1 do not proceed onto 
programmed cell death or apoptosis due to the negative 

25 regulation of Fas by the phosphatase. However, if Fas- 

associated phosphatase 1 is unable to bind to the 
carboxyl terminus of the Fas receptor ( (S/T) -X- (V/L/I) 
region) , e.g. mutation or deletion of at - least one of 
the amino acids in the amino acid sequence (G/S/A/E) -L-G- 

30 (F/I/L) , the cell will proceed to apoptosis. By 

introducing a compound capable of binding to the carboxyl 
terminus of the Fas receptor, one could mimic the effects 
of a fiinctional phosphatase and thus restore the negative 
regulation of apoptosis. 

35 



This invention also provides a method of preventing 
apoptosis in a cell comprising the above-described 
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composition or a compound identified by the above - 
described method. 



This invention also provides a means of treating 
5 pathogenic conditions caused by apoptosis of relevant 

cells comprising the above-described composition or the 
compound identified by the above -described method. 

This invention is illustrated in the Experimental Details 
10 section which follows. These sections are set forth to 

aid in an understanding of the invention but are not 
intended to, and should not be constmed to, limit in any 
way the invention as set forth in the claims which follow 
thereafter . 
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FIRST SERIES OF EXPERIMENTS 
Experimental Details 
5 Methods and Materials 

1. Screening a semi -random and random peptide library. 

To create numerous mutations in a restricted DNA 
10 sequence, PGR mutagenesis with degenerate 

oligonucleotides was employed according to a protocol 
described elsewhe re (Hill, et al . 1987) . Based on the 
homology between human and rat, two palindromic sequences 
were designed for construction of semi -random library. 
15 The two primers used were 

5' - CG GAATTC NNNNNNNNNAACAGCNNNISn^ 

NTG AGGATCC TCA- 3 ' (Seq. I.D. No.: ^ 3sQ) and 

5' -CGGAMlTCGACTCAGAANNNNNNAACTTCAGANNNNim 



CTGA GGATCC TCA-3 ' (Seq. I.D. No . : . Briefly, the two 

20 primers (each 200 pmol) , purified by HPLC, were annealed 

at 70 °C for 5 minutes and cooled at 23 °C for 60 minutes. 
A Klenow fragment (5 U) was used for filling in with a 
dNTP mix (final concentration, 1 mM per each dNTP) at 
23°C for 60 minutes. The reaction was stopped with 1 ^1 

25 of 0.5 M EDTA and the DNA was purified with ethanol 

precipitation. The resulting double -stranded DNA was 
digested with EcoRI and BamHI and re-purified by 
electrophoresis on non-denaturing polyacrylamide gels. 
The double-strand oligonucleotides were then ligated into 

3 0 the EcoRI -BamHI sites of the pBTMlie plasmid. The 

ligation mixtures were electroporated into the E. coli 
XLl-Blue MRF' (Stratagene) for the plasmid library. The 
large scale transformation was carried out as previously 
reported. The plasmid library was transformed into 

35 L40-strain cells (MATa, trpl, leu2, his3 , ade2 , 

LYS2: (lexAop)^-HIS3 , URA3 : : (lexAopf -lacZ) carrying the 
plasmid pVP16-31 containing a FAP-1 cDNA (Sato, et al . 
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19 95) . Clones that formed on hist idine-def icient medium 
(His*) were transferred to plates containing 4 0 /xg/ml 
X-gal to test for a blue reaction product (S-gal*) in 
plate and filter assays. The clones selected by His* and 
6-gal* assay were tested for further analysis. The 
palindromic oligonucleotide. 



5^ -C GQAATTC - (NNN) . . -TG AGGATCC TCA- 3 ^ (Seq. I.D. No. 5^), 



was used for the construction of the random peptide 
library. 



Peptides were automatically synthesized on an Advanced 
ChemTech ACT3 5 7 by analogy to published procedures 
(Schnorrenberg and Gerhardt, 1989). Wang resin (0.2-0.3 
mmole scale) was used for each run and N^-Fmoc protection 
was employed for all amino acids. Deprotection was 
achieved by treatment with 20% piperidine/DMF and 
coupling was completed using DIC/HOBt and subsequent 
HBTU/DIEA. After the last amino acid was coupled, the 
growing peptide on the resin was acetylated with AC2O/DMF . 
The peptide was cleaved from the resin with concomitant 
removal of all protecting groups by treating with TFA. 
The acetylated peptide was purified by HPLC and 
characterized by FAB-MS and ^H-NMR. 

3. Inhibition asssay of Fas/FAP-1 binding using the C- 
terminal 15 amino acids of Fas. 

HFAP-10 cDNA (Sato, et al . 1995) subcloned into the 
Bluescript vector pSK-II (Stratagene) was in 
vitro- translated from an internal methionine codon in the 
presence of ^^S-L-methionine using a coupled in vitro 
transcription/translation system (Promega, TNT lysate) 
and T7 RNA polymerase. The resulting ^^S- labeled protein 
was incubated with GST-Fas fusion proteins that had been 
immobilized on GST-Sepharose 4B affinity beads 
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(Pharmacia) in a buffer containing 150 mM NaCl, 50 wM 
Tris [pH 8.0], 5 mM DTT, 2 mM EDTA, 0.1 % NP-40, 1 mM 
PMSF, 50 /ig/ml leupeptin, 1 mM Benzamidine , and 7 /xg/ml 
pepstatin for 16 hours at 4 *C. After washing vigorously 
4 times in the same buffer, associated proteins were 
recovered with the glutathione -Sepharose beads by 
centrif ugation, eluted into boiling Laemmli buffer, and 
analyzed by SDS-PAGE and f luorography . 

4 . Inhibition assay of terminal 15 amino acids of Fas 
and inhibitory effect of Fas/FAP-1 binding using 
diverse tripeptides . 

In vitro-translated [^^S]HFAP-1 was purified with a NAP- 5 
column (Pharmacia) and incubated with 3 /xM of GST- fusion 
proteins for 16 hours at 4*C. After washing 4 times in 
the binding buffer, radioactivity incorporation was 
determined in a b counter. The percentage of binding 
inhibition was calculated as follows: percent inhibition 
[radioactivity incorporation using GST-Fas (191-335) 
with peptides - radioactivity incorporation using GST- Fas 
(191-320) with peptides] / [radioactivity incorporation 
using GST-Fas (191-335) without peptides - radioactivity 
incorporation using GST-Fas (191-320) without peptides] . 
n=3 . 

5. Interaction of the C-terminal 3 amino acids of Fas 
with FAP-1 in yeast and in vitro. 

The bait plasmids, pBTM116 (LexA) -SLV, -PLV, -SLY, and 
-SLA, were constructed and transformed into L40-strain 
with pVP16 -FAP-1 or -ras. Six independent clones from 
each transf ormants were picked up for the analysis of 
growth on histidine-def icient medium. GST-Fas, -SLV, and 
PLV were purified with GST-Sepharose 4B affinity beads 
(Pharmacia) . The methods for in vitro binding are 
described above . 
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6. Immuno-precipitation of native Fas with GST-FAP-1 
and inhibition of Fas/FAP-1 binding with Ac-SLV. 

GST-fusion proteins with or without FAP-1 were incubated 
5 with cell extracts from Jurkat T-cells expressing Fas. 

The bound Fas was detected by Western analysis using 
anti-Fas monoclonal antibody (F22120, Transduction 
Laboratories) . The tripeptides, Ac-SLV and Ac-SLY were 
used for the inhibition assay of Fas/FAP-1 binding. 

10 

7. Microinjection of Ac-SLV into the DLD-1 cell line. 
DLD-1 human colon cancer cells were cultured in RPMI 164 0 
medium containing 10% FCS . For microinjection, cells 
were plated on CELLocate (Eppendorf) at 1 X 10^ cells/2 ml 

15 in a 35 mm plastic culture dish and grown for 1 day. Just 

before microinjection. Fas monoclonal antibodies CHll 
(MBL International) was added at the concentration of 500 
ng/ml. All microinjection experiments were performed 
using an automatic microinjection system (Eppendorf 

20 trans jector 5246, micro-manipulator 5171 and Femtotips) 

(Pantel, et al . 1995). Synthetic tripeptides were 
suspended in 0.1% (w/v) FITC-Dextran (Sigma) /K- PBS at the 
concentration of 10 0 mM. The samples were microinj ected 
into the cytoplasmic region of DLD-1 cells. Sixteen to 

25 20 hours postinjection, the cells were washed with PBS 

and stained with 10 fig/ml Hoechst 33342 in PBS. After 
incubation at 3 7°C for 3 0 minutes, the cells were 
photographed and the cells showing condensed chromatin 
were counted as apoptotic. 

30 

8. Quantitation of apoptosis in microinj ected DLD-1 
cells. 

For each experiment, 25-100 cells were microinj ected. 
35 Apoptosis of microinjected cells was determined by 

assessing morphological changes of chromatin using phase 
contrast and fluorescence microscopy (Wang, et al . , 1995; 
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McGahon, et al . , 1995). The data are means +/- S.D, for 
two or three independent determinations. 



Discussion 

5 

In order to identify the minimal peptide stretch in the 
C-terminal region of the Fas receptor necessary for FAP-1 
binding, an in vitro inhibition assay of Fas/FAP-1 
binding was used using a series of synthetic peptides as 

10 well as yeast two-hybrid system peptide libraries (Figure 

2A) . First, semi-random libraries (based on the homology 
between human and rat Fas) (Figures 2B and 2C) of 15 
amino acids fused to a LexA DNA binding domain were 
constructed and co- transformed into yeast strain L4 0 with 

15 pVP16-31 (Sato, et al . 1995) that was originally isolated 

as FAP-1. After the selection of 200 His'' colonies from an 
initial screen of 5.0 X 10^ (Johnson, et al . 1986) 
transf ormants , 100 colonies that were i8-galactosidase 
positive were picked for further analysis. Sequence 

2 0 analysis of the library plasmids encoding the C-terminal 

15 amino acids revealed that all of the C-termini were 
either valine, leucine or isoleucine residues. Second, 
a random library of 4-15 amino acids fused to a LexA DNA 
binding domain was constructed and screened according to 
25 this strategy (Figure 2D) . Surprisingly, all of the third 

amino acid residues from the C-termini were serine, and 
the results of C-terminal amino acid analyses were 
identical to the screening of the semi -random cDNA 
libraries. No other significant amino acid sequences were 

3 0 found in these library screenings, suggesting that the 

motifs of the last three amino acids (tS-X-V/L/l) are 
very important for the association with the third PDZ 
domain of FAP-1 and play a crucial role in 
protein-protein interaction as well as for the regulation 
35 of Fas-induced apoptosis. To further confirm whether the 

last three amino acids are necessary and sufficient for 
Fas/FAP-1 binding, plasmids of the LexA-SLV, -PLV, -PLY, 
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-SLY, and -SLA fusion proteins were constructed and 
CO- transformed into yeast with pVP16-FAP-l. The results 
showed that only LexA-SLV associated with FAP-1, whereas 
LexA-PLV, -PLY, -SLY, and -SLA did not (Figure 4 A) . In 
5 vitro binding studies using various GST-tripeptide 

fusions and in vi tro- translated FAP-1 were consistent 
with these results (Figure 4B) . 

In addition to yeast two- hybrid approaches, in vitro 

10 inhibition assay of Fas/FAP-1 binding was also used. 

First, a synthetic peptide of the C- terminal 15 amino 
acids was tested whether it could inhibit the binding of 
Fas and FAP-1 in vitro (Figure 3A) . The binding of in 
vi tro- translated FAP-1 to GST-Fas was dramatically 

15 reduced and dependent on the concentration of the 

synthetic 15 amino acids of Fas. In contrast with these 
results, human PAMP peptide (Kitamura, et al . 1994) as a 
negative control had no effect on Fas/FAP-1 binding 
activity under the same biochemical conditions. Second, 

20 the effect of truncated C-terminal synthetic peptides of 

Fas on Fas/FAP-1 binding in vitro was examined. As shown 
in Figure 3B, only the three C-terminal amino acids 
(Ac-SLV) were sufficient to obtain the same level of 
inhibitory effect on the binding of FAP-1 to Fas as 

25 achieved with the 4-15 synthetic peptides. Furthermore, 

Fas/FAP-1 binding was extensively investigated using the 
scanned tripeptides to determine the critical amino acids 
residues required for inhibition (Figure 3C) . The 
results revealed that" the third amino acids residues from 

3 0 the C- terminus, and the C-terminal amino acids having the 

strongest inhibitory effect were either serine or 
threonine; and either valine, leucine, or isoleucine, 
respectively. However, there were no differences among 
the second amino acid residues from the C- terminus with 

35 respect to their inhibitory effect on Fas/FAP-1 binding. 

These results were consistent with those of the yeast 
two-hybrid system (Figures 2C and 2D) . Therefore, it was 
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concluded that the C- terminal three amino acids (SLV) are 
critical determinants of Fas binding to the third PDZ 
domain of FAP-1 protein. 



5 To further substantiate that the PDZ domain interacts 

with tS/T-X-V/L/I under more native conditions, GST-fused 
FAP-1 proteins were tested for their ability to interact 
with Fas expressed in Jurkat T-cells. The results 
revealed that the tripeptide Ac-SLV, but not Ac-SLY, 

10 abolished in a dose -dependent manner the binding activity 

of FAP-1 to Fas proteins extracted from Jurkat T-cells 
(Figures 4C and 4D) . This suggests that the C-terminal 
amino acids tSLV are the minimum binding site for FAP-1, 
and that the amino acids serine and valine are critical 

15 for this physical association. 

To next examine the hypothesis that the physiological 
association between the C-terminal three amino acids of 
Fas and the third PDZ domain of FAP-1 is necessary for 

20 the in vivo function of FAP-1 as a negative regulator of 

Fas -mediated signal transduct ion , a microinj ect ion 
experiment was employed with synthetic tripeptides in a 
colon cancer cell line, DLD-1, which expresses both Fas 
and FAP-1, and is resistant to Fas- induced apoptosis. 

25 The experiments involved the direct microinjection of the 

synthetic tripeptides into the cytoplasmic regions of 
single cells and the monitoring of the physiological 
response to Fas-induced apoptosis in vivo. The results 
showed that microinjection of Ac-SLV into DLD-1 cells 

3 0 dramatically induced apoptosis in the presence of 

Fas -monoclonal antibodies (CHll, 500 ng/ml) (Figures 5A, 
5E and Figure 6) , but that microinjection of Ac-SLY and 
PBS/K did not (Figures 5B, 5F and Figure 6) . These 
results strongly support the hypothesis that the physical 

35 association of FAP-1 with the C- terminus of Fas is 

essential for protecting cells from Fas-induced 
apoptosis . 
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In summary, it was found that the C-terminal SLV of Fas 
is alone necessary and sufficient for binding to the 
third PDZ domain of FAP-1. Secondly, it is proposed that 
the new consensus motif of tS/T-X-V/L/I for such binding 
to the PDZ doma in, instead of tS/T-X-V. It is therefore 
possible that FAP-1 plays important roles for the 
modulation of signal transduction pathways in addition to 
its physical interaction with Fas. Thirdly, it is 
demonstrated that the targeted induction of Fas-mediated 
apoptosis in colon cancer cells by direct microinjection 
of the tripeptide Ac -SLV. Further investigations 
including the identification of a substrate (s) of FAP-1 
and structure-function analysis will provide insight to 
the potential therapeutic applications of Fas/FAP-1 
interaction in cancer as well as provide a better 
understanding of the inhibitory effect of FAP-1 on 
Fas-mediated signal transduction. 
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SECOND SERIES OF EXPERIMENTS 



FAP-1 was originally identified as a membrane -associated 
protein tyrosine phosphatase which binds to the C- 
5 terminus of Fas, and possesses six PDZ domains (also 

known as DHR domain or GLGF repeat) . PDZ domain has 
recently been shown as a novel module for specific 
protein-protein interaction, and it appears to be 
important in the assembly of membrane proteins and also 

10 in linking signaling molecules in a multiprotein complex. 

In recent comprehensive studies, it was found that the 
third PDZ domain of FAP-1 specifically recognized the 
sequence motif t (S/T) -X-V and interacts with the C- 
terminal three amino acids SLV of Fas (Fig. 9) . In order 

15 to investigate the possibility that FAP-1 also interacts 

with the C- terminal region of p75NGFR (Fig. 8) , an in 
vitro binding assay, was performed as well as, a yeast 
two-hybrid analysis by using a series of deletion mutants 
of p75NGFR. The results revealed that the C- terminal 

2 0 cytoplasmic region of p75NGFR, which is highly conserved 

among all species, interacts with FAP-1 (Fig, 10). 
Furthermore, the C- terminal three amino acids SPV of 
p75NGFR were necessary and sufficient for the interaction 
with the third PDZ domain of FAP-1 (Fig. IIA and IIB) . 
25 Since FAP-1 expression was found highest in fetal brain, 

these findings imply that interaction of FAP-1 with 
p75NGFR plays an important role for signal transduction 
pathway via p75NGFR in neuronal cells as well as in the 
formation of the initial signal- transducing complex for 

3 0 p75NGFR. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Takaaki Sato and Junn Yanagisawa 

(ii) TITLE OF INVENTION: COMPOUNDS THAT INHIBIT THE 

INTERACTION BETWEEN SIGNAL- 
10 TRANSDUCING PROTEINS AND THE GLGF 

(PDZ/DHR) DOMAIN AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 3 3 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

2 0 (E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

3 0 (A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: 18-JUL-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
3 5 (A) NAME: White, John P 

(B) REGISTRATION NUMBER: 28,678 

(C) REFERENCE /DOCKET NUMBER: 0575/48 962-A- PCT/JPW/JKM 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 

(2) INFORMATION FOR SEQ ID NO : 1 : 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

50 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
60 1 



(2) INFORMATION FOR SEQ ID NO : 2 : 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 



# 




wo 98/05347 PCT/US97/12677 

-37- 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys/Arg/Gln Xaa (n) Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
1 5 



15 



25 



35 



45 



55 



65 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
2 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

( i V ) ANTI - SENSE : NO 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Ser Leu Gly lie 

1 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

50 (xi) SEQtTENCE DESCRIPTION: SEQ ID NO : 4 : 



Ser/Thr Xaa Val/Ile/Leu 
1 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 

5 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ser lie Ser Asn Ser Arg Asn Glu Asn Glu Gly Gin Ser Leu Glu 
15 10 15 



20 



30 



35 



(2) INFORMATION FOR SEQ ID NO : 7 ; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser Thr Pro Asp Thr Gly Asn Glu Asn Glu Gly Gin Cys Leu Glu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Glu Ser Leu Val 
50 1 

(2) INFORMATION FOR SEQ ID NO : 9 : 

55 (i) SEQUENCE CHJ\RACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

60 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

6 5 Thr lie Gin Ser Val lie 

1 5 



# 



• 
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10 



15 



20 

is s? 

25 



30 



n 35 

t 40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Gly Phe lie Ser Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Glu Thr lie Glu Ser Thr Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gin Asn Phe Arg Thr Tyr lie Val Ser Phe Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ser Asp Ser Asn Met Asn Met Asn Glu Leu Ser Glu Val 
15 10 



65 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

5 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

10 Pro Pro Thr Cys Ser Gin Ala Asn Ser Gly Arg lie Ser Thr Leu 

15 10 15 



15 



25 



60 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



lie Asp Leu Ala Ser Glu Phe Leu Phe Leu Ser Asn Ser Phe Leu 
15 10 15 



3 0 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asp Ser Glu Met Tyr Asn Phe Arg Ser Gin Leu Ala Ser Val Val 
15 10 15 

4 5 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE : amino acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

lie Pro Pro Asp Ser Glu Asp Gly Asn Glu Glu Gin Ser Leu Val 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 18 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
65 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18; 

5 Gin Ser Leu Val 

1 



10 



20 



55 



65 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



lie Gin Ser Leu Val 
1 5 



2 5 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Glu lie Gin Ser Leu Val 
1 5 

4 0 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Asn Glu lie Gin Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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Arg Asn Glu lie Gin Ser Leu Val 
1 5 

5 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUKNCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 



20 



30 



35 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 7 amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 25 30 



Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
40 35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 

4 5 Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 

65 70 75 80 



Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
55 115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 140 

6 0 Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 

145 150 155 160 



Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 
180 185 190 
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Gly Arg Trp lie Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 
195 200 205 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu lie 
5 210 215 220 

Ala Ser Thr Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 

225 230 235 240 

10 Pro Val Val Thr Arg Gly Thr Thr Asp Asn Leu lie Pro Val Tyr Cys 

245 250 255 



15 



30 



45 



Ser lie Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr lie Ala Phe 
260 265 270 

Lys Arg Trp Asn Ser Cys Lys Gin Asn Lys Gly Gly Ala Asn Ser Arg 
275 280 285 



Pro Val Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys lie His Ser Asp 
20 290 295 300 

Ser Gly lie Ser Val Asp Ser Gin Ser Leu His Asp Gin Gin Pro His 
305 310 315 320 

2 5 Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 

325 330 335 



Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 350 

Gly Ser Ala Gly Asp Thr Trp Arg His Leu Ala Gly Glu Leu Gly Tyr 
355 360 365 



Gin Pro Glu His lie Asp Ser Phe Thr His Glu Ala Cys Pro Val Arg 
35 370 375 380 

Ala Leu Leu Ala Ser Trp Ala Thr Gin Asp Ser Ala Thr Leu Asp Ala 
385 390 395 400 

4 0 Leu Leu Ala Ala Leu Arg Arg lie Gin Arg Ala Asp Leu Val Glu Ser 

405 410 415 



Leu Cys Ser Glu Ser Thr Ala Thr Ser Pro Val 
420 425 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHT^RACTERISTICS : 
50 (A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
60 1 5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

6 5 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

35 40 45 
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lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu lie lie 
85 90 95 

Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr lie Cys Glu Val Glu 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr lie Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn lie Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys lie Asp lie Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Ser lie Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 

Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 
225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp lie Thr Phe Asp Leu 
245 250 255 

Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 
260 265 270 

Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 

Thr Gly Lys Leu His Gin Glu Asn Val Leu Val Val Met Arg Ala Thr 
305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 
340 345 350 

Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn lie 
370 375 380 

Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu lie 
385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe lie Gly Leu Gly lie 
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405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 430 

5 

Ser Gin lie Lys Arg Leu Leu Ser Glu Lys Lys Glu Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro lie 
10 450 455 

(2) INFORMATION FOR SEQ ID NO: 26: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

2 5 Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 

15 10 15 

Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp lie 
20 25 30 



30 



45 



60 



Val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Asp Leu Leii 
35 40 45 



Glu Lys Lys Leu Ala. Lys Ala Gin Cys Glu Gin Ser His Leu Met Arg 
35 50 55 60 

Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg 

65 70 75 80 

40 lie Thr Glu Leu His Ser Val lie Ala Glu Leu Asn Lys Lys lie Asp 

85 90 95 



Arg Leu Gin Gly Thr Thr lie Arg Glu Glu Asp Glu Tyr Ser Glu Leu 
100 105 110 

Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser Arg 
115 120 125 



Ser Met Asp Gin Asp Gin Thr Ser Val Ser lie Pro Glu Asn Gin Ser 
50 130 135 140 

Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp lie Asn Ser Glu 
145 150 155 160 

55 Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg Lys 

165 170 175 



Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His lie Glu 
180 185 190 

Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala lie Lys Thr Val 
195 200 205 



Glu Glu lie Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu Ala 
65 210 215 220 

Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu 
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225 230 235 240 

Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn 
245 250 255 

5 

Arg Thr Lys Ala Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg Leu 
260 265 270 

Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala 
10 275 280 285 

Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro lie 
290 295 300 

15 Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie 

305 310 315 320 



20 



35 



50 



65 



Pro lie Ala Lys lie Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser 

325 330 335 

Glu Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu lie Ser Ser 
340 345 350 



lie Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin 
25 355 360 365 

Asp Cys Ser Asn lie Gin Glu lie Phe Gin Thr Leu Tyr Ser His Gly 
370 375 380 

3 0 Ser Ala He Ser Glu Ser Lys He Arg Glu Phe Glu Val Glu Thr Glu 

385 390 395 400 



Arg Leu Asn Ser Arg He Glu His Leu Lys Ser Gin Asn Asp Leu Leu 
405 410 415 

Thr He Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met 
420 425 430 



Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu 
40 435 440 445 

Gin Tyr Ser Glu Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala Leu 
450 455 460 

45 Ala Glu Ser Glu Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala Gly 

465 470 475 480 



Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr Gin 
485 490 495 

Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala Ala 
500 505 510 



Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala 
55 515 520 525 

Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn Ser 
530 535 540 

60 His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe 

545 550 555 560 



Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu Lys 
565 570 575 

Asn Asp Arg Ala Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser He 
580 585 590 
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His lie Asp Pro Leu Ser Tyx Asp Val Lys Pro Arg Gly Asp Ser Gin 

595 600 605 

Arg Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met 

5 610 615 620 

Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys 

625 630 635 640 

10 Glu Lys Lys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu 

645 650 655 



15 



30 



45 



Gin Ala Tyr Leu Val His lie Glu His Leu Lys Ser Glu Val Glu Glu 
660 665 670 

Gin Lys Glu Gin Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser 
675 680 685 



Lys Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu 
20 690 695 700 

Ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala 

705 710 715 720 

2 5 Glu Phe Thr Asn Ala lie Arg Arg Glu Lys Lys Leu Lys Ala Arg Val 

725 730 735 



Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu lie 

740 745 750 

Arg His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn 

755 760 765 



Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin Asn 
35 770 775 780 

Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His 
785 790 795 800 

4 0 Glu Thr Gin Val Arg Met Leu Lys Gin Arg lie Ala Leu Leu Glu Glu 

805 810 815 



Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

5 0 (A) LENGTH: 672 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Ala Asp Val Phe Pro Gly Asn Asp Ser Thr Ala Ser Gin Asp Val 
60 1 5 10 15 

Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 
20 25 30 

6 5 Glu Val Lys Asp His Lys Phe He Ala Arg Phe Phe Lys Gin Pro Thr 

35 40 45 
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Phe Cys Ser His Cys Thr Asp Phe lie Trp Gly Phe Gly Lys Gly Gly 
50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
5 . 65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
85 90 95 

10 Asp Pro Arg Ser Lys His Lys Phe Lys lie His Thr Tyr Gly Ser Pro 

100 105 110 



15 



30 



45 



60 



Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu lie His Gin 
115 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Lys Gin Cys Val 
130 135 140 



lie Asn Val Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 

20 145 150 155 160 

Arg lie Tyr Leu Lys Ala Glu Val Ala Asp Glu Lys Leu His Val Thr 

165 170 175 

25 Val Arg Asp Ala Lys Asn Leu lie Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 



Asp Pro Tyr Val Lys Leu Lys Leu lie Pro Asp Pro Lys Asn Glu Ser 

195 200 205 

Lys Gin Lys Thr Lys Thr lie Arg . Ser Thr Leu Asn Pro Gin Trp Asn 

210 215 220 



Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
35 225 230 235 240 

Ser Val Glu lie Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

4 0 Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 

260 265 270 

Gly Trp Tyr Lys Leu Leu Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 



Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 



Phe Glu Lys Ala Lys Leu Gly Pro Ala Gly Asn Lys Val lie Ser Pro 
50 305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
325 330 335 

55 Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala lie Lys 
355 360 365 



lie Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 



Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
65 385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
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405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He Gin Gin Val 
420 425 430 

5 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 
435 440 445 

He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
10 450 455 460 

Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His He Lys He Ala 
465 470 475 480 

15 Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 

485 490 495 



20 



35 



50 



Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 
500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 

515 520 525 



Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
25 530 535 540 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 

545 550 555 560 

3 0 Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys His 

565 570 575 



Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 
580 585 590 

Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 
595 600 605 



Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
40 610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 

625 630 635 640 

4 5 Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 



Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 
660 665 670 



{ 2 ) INFORMATION FOR SEQ ID NO : 2 8 : 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

65 Met Asp He Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 

15 10 15 
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Ser Leu Met Gin Leu Asn Asp Asp Thr Arg Leu Tyr Ser Asn Asp Phe 
20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
5 35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
50 55 60 

10 Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 

65 70 75 80 



15 



30 



45 



60 



Thr Ala Val Val lie He Leu Thr He Ala Gly Asn He Leu Val He 
85 90 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 110 



Leu Met Ser Leu Ala He Ala Asp Met Leu Leu Gly Phe Leu Val Met 

20 115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 

130 135 140 

2 5 Ser Lys Leu Cys Ala Val Trp He Tyr Leu Asp Val Leu Phe Ser Thr 

145 150 155 160 



Ala Ser He Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 

165 170 175 

He Gin Asn Pro He His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 

180 185 190 



Phe Leu Lys He He Ala Val Trp Thr He Ser Val Gly He Ser Met 
35 195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

4 0 Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 

225 230 235 240 



Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu 
245 250 255 

Thr He Lys Ser Leu Gin Lys Glu Ala Thr Leu Cys Val Ser Asp Leu 
260 265 270 



Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gin Ser Ser 
50 275 280 285 

Leu Ser Ser Glu Lys Leu Phe Gin Arg Ser He His Arg Glu Pro Gly 
290 295 300 

55 Ser Tyr Thr Gly Arg Arg Thr Met Gin Ser He Ser Asn Glu Gin Lys 

305 310 315 320 

Ala Cys Lys Val Leu Gly He Val Phe Phe Leu Phe Val Val Met Trp 
325 330 335 



Cys Pro Phe Phe He Thr Asn He Met Ala Val He Cys Lys Glu Ser 
340 345 350 



Cys Asn Glu Asp Val He Gly Ala Leu Leu Asn Val Phe Val Trp He 
65 355 360 365 

Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 



# 
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370 375 380 

Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr lie Gin Cys Gin Tyr Lys 
385 390 395 400 

5 

Glu Asn Lys Lys Pro Leu Gin Leu lie Leu Val Asn Thr lie Pro Ala 
405 410 415 

Leu Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser 
10 420 425 430 

Lys Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu 
435 440 445 

15 Gly Lys Gin His Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val 

450 455 460 



20 



45 



60 



Asn Glu Lys Val Ser Cys Val 
465 470 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ala Leu Ser Tyr Arg Val Ser Glu Leu Gin Ser Thr lie Pro Glu 
35 1 5 10 15 

His lie Leu Gin Ser Thr Phe Val His Val He Ser Ser Asn Trp Ser 
20 25 30 

4 0 Gly Leu Gin Thr Glu Ser He Pro Glu Glu Met Lys Gin He Val Glu 

35 40 45 



Glu Gin Gly Asn Lys Leu His Trp Ala Ala Leu Leu He Leu Met Val 

50 55 60 

He He Pro Thr He Gly Gly Asn Thr Leu Val He Leu Ala Val Ser 

65 70 75 80 



Leu Glu Lys Lys Leu Gin Tyr Ala Thr Asn Tyr Phe Leu Met Ser Leu 

50 85 90 95 

Ala Val Ala Asp Leu Leu Val Gly Leu Phe Val Met Pro He Ala Leu 
100 105 110 

55 Leu Thr He Met Phe Glu Ala Met Trp Pro Leu Pro Leu Val Leu Cys 

115 120 125 



Pro Ala Trp Leu Phe Leu Asp Val Leu Phe Ser Thr Ala Ser He Met 

130 135 140 

His Leu Cys Ala He Ser Val Asp Arg Tyr He Ala He Lys Lys Pro 

145 150 155 160 



He Gin Ala Asn Gin Tyr Asn Ser Arg Ala Thr Ala Phe He Lys He 
65 165 170 175 

Thr Val Val Trp Leu He Ser He Gly He Ala He Pro Val Pro He 



# 
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180 185 190 

Lys Gly lie Glu Thr Asp Val Asp Asn Pro Asn Asn lie Thr Cys Val 
195 200 205 

5 

Leu Thr Lys Glu Arg Phe Gly Asp Phe Met Leu Phe Gly Ser Leu Ala 
210 215 220 

Ala Phe Phe Thr Pro Leu Ala lie Met lie Val Thr Tyr Phe Leu Thr 
10 225 230 235 240 

lie His Ala Leu Gin Lys Lys Ala Tyr Leu Val Lys Asn Lys Pro Pro 
245 250 255 

15 Gin Arg Leu Thr Trp Leu Thr Val Ser Thr Val Phe Gin Arg Asp Glu 

260 265 270 



20 



35 



50 



Thr Pro Cys Ser Ser Pro Glu Lys Val Ala Met Leu Asp Gly Ser Arg 
275 280 285 

Lys Asp Lys Ala Leu Pro Asn Ser Gly Asp Glu Thr Leu Met Arg Arg 
290 295 300 



Thr Ser Thr lie Gly Lys Lys Ser Val Gin Thr lie Ser Asn Glu Gin 
25 305 310 315 320 

Arg Ala Ser Lys Val Leu Gly lie Val Phe Phe Leu Phe Leu Leu Met 
325 330 335 

3 0 Trp Cys Pro Phe Phe lie Thr Asn lie Thr Leu Val Leu Cys Asp Ser 

340 345 350 

Cys Asn Gin Thr Thr Leu Gin Met Leu Leu Glu lie Phe Val Trp lie 
355 360 365 



Gly Tyr Val Ser Ser Gly Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
370 375 380 



Lys Thr Phe Arg Asp Ala Phe Gly Arg Tyr lie Thr Cys Asn Tyr Arg 

40 385 390 395 400 

Ala Thr Lys Ser Val Lys Thr Leu Arg Lys Arg Ser Ser Lys lie Tyr 

405 410 415 

4 5 Phe Arg Asn Pro Met Ala Glu Asn Ser Lys Phe Phe Lys Lys His Gly 

420 425 430 



lie Arg Asn Gly lie Asn Pro Ala Met Tyr Gin Ser Pro Met Arg Leu 
435 440 445 

Arg Ser Ser Thr lie Gin Ser Ser Ser lie lie Leu Leu Asp Thr Leu 
450 455 460 



Leu Leu Thr Glu Asn Glu Gly Asp Lys Thr Glu Glu Gin Val Ser Val 
55 465 470 475 480 

Val 

60 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 3 amino acids 

(B) TYPE: amino acid 

6 5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

5 Met Ala Ala Ala Ser Tyr Asp Gin Leu Leu Lys Gin Val Glu Ala Leu 

15 10 15 



10 



25 



40 



55 



Lys Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 
20 25 30 

His Leu Thr Lys Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 40 45 



Lys Gin Leu Gin Gly Ser lie Glu Asp Glu Ala Met Ala Ser Ser Gly 
15 50 55 60 

Gin lie Asp Leu Leu Glu Arg Leu Lys Glu Leu Asn Leu Asp Ser Ser 
65 70 75 80 

2 0 Asn Phe Pro Gly Val Lys Leu Arg Ser Lys Met Ser Leu Arg Ser Tyr 

85 90 95 



Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val Asn Gly Ser Arg 
115 120 125 



Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 

30 130 135 140 

Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 

145 150 155 160 

3 5 Gin Leu Gin Asn Leu Thr Lys Arg lie Asp Ser Leu Pro Leu Thr Glu 

165 170 175 



Asn Phe Ser Leu Gin Thr Asp Met Thr Arg Arg Gin Leu Glu Tyr Glu 

180 185 190 

Ala Arg Gin lie Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 
195 200 205 



Asp Met Glu Lys Arg Ala Gin Arg Arg lie Ala Arg lie Gin Gin lie 
45 210 215 220 

Glu Lys Asp lie Leu Arg lie Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 230 235 240 

50 Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 

245 250 255 



Ala Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu lie Asn Met Ala 
260 265 270 

Thr Ser Gly Asn Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr 
275 280 285 



Ala Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 
60 290 295 300 

Thr Ser His Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu Ser 
305 310 315 320 

65 Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 

325 330 335 




wo 98/05347 PCT/US97/12677 

-54- 

Met Ser Ser Ser Gin Asp Ser Cys lie Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro beu Leu lie Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
5 355 360 365 

Leu Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

10 Ala Ala Leu His Asn lie lie His Ser Gin Pro Asp Asp Lys Arg Gly 

385 390 395 . 400 



15 



30 



45 



60 



Arg Arg Glu lie Arg Val Leu His Leu Leu Glu Gin lie Arg Ala Tyr 

405 410 415 

Cys Ser Thr Cys Trp Glu Trp Gin Glu Ala His Glu Pro Gly Met Asp 
420 425 430 



Gin Asp Lys Asn Pro Met Pro Ala Pro Val Glu His Gin lie Cys Pro 
20 435 440 445 

Ala Val Cys Val Leu Met Lys Leu Ser Phe Asp Glu Glu His Arg His 
450 455 460 

2 5 Ala Met Asn Glu Leu Gly Gly Leu Gin Ala lie Ala Glu Leu Leu Gin 

465 470 475 480 



Val Asp Cys Glu Met Tyr Gly Leu Thr Asn Asp His Tyr Ser lie Thr 

485 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 

500 505 510 



Val Ala Asn Lys Ala Thr Leu Cys Ser Met Lys Gly Cys Met Arg Ala 
35 515 520 525 

Leu Val Ala Gin Leu Lys Ser Glu Ser Glu Asp Leu Gin Gin Val lie 
530 535 540 

4 0 Ala Ser Val Leu Arg Asn Leu Ser Trp Arg Ala Asp Val Asn Ser Lys 

545 550 555 560 



Lys Thr Leu Arg Glu Val Gly Ser Val Lys Ala Leu Met Glu Cys Ala 
565 570 575 

Leu Glu Val Lys Lys Glu Ser Thr Leu Lys Ser Val Leu Ser Ala Leu 
580 585 590 



Trp Asn Leu Ser Ala His Cys Thr Glu Asn Lys Ala Asp lie Cys Ala 

50 595 600 605 

Val Asp Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser 
610 615 620 

55 Gin Thr Asn Thr Leu Ala lie lie Glu Ser Gly Gly Gly lie Leu Arg 

625 630 635 640 



Asn Val Ser Ser Leu lie Ala Thr Asn Glu Asp His Arg Gin lie Leu 

645 650 655 

Arg Glu Asn Asn Cys Leu Gin Thr Leu Leu Gin His Leu Lys Ser His 
660 665 670 



Ser Leu Thr lie Val Ser Asn Ala Cys Gly Thr Leu Trp Asn Leu Ser 
65 675 680 685 

Ala Arg Asn Pro Lys Asp Gin Glu Ala Leu Trp Asp Met Gly Ala Val 
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690 695 700 

Ser Met Leu Lys Asn Leu lie His Ser Lys His Lys Met lie Ala Met 
705 710 715 720 

5 

Gly Ser Ala Ala Ala Leu Arg Asn Leu Met Ala Asn Arg Pro Ala Lys 
725 730 735 

Tyr Lys Asp Ala Asn lie Met Ser Pro Gly Ser Ser Leu Pro Ser Leu 
10 740 745 750 

His Val Arg Lys Gin Lys Ala Leu Glu Ala Glu Leu Asp Ala Gin His 
755 760 765 

15 Leu Ser Glu Thr Phe Asp Asn lie Asp Asn lie Ser Pro Lys Ala Ser 

770 775 780 



20 



35 



50 



65 



His Arg Ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 

785 790 795 800 

Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 

805 810 815 



Gly Asn Met Thr Val Leu Ser Pro Tyr Leu Asn Thr Thr Val Leu Pro 

25 820 825 830 

Ser Ser Ser Ser Ser Arg Gly Ser Leu Asp Ser Ser Arg Ser Glu Lys 

835 840 ^- 845 

3 0 Asp Arg Ser Leu Glu Arg Glu Arg Gly lie Gly Leu Gly Asn Tyr His 

850 855 860 



Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin lie 
865 870 875 880 

Ser Thr Thr Ala Ala Gin lie Ala Lys Val Met Glu Glu Val Ser Ala 
885 890 895 



lie His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 
40 900 905 910 

His Cys Val Thr Asp Glu Arg Asn Ala Leu Arg Arg Ser Ser Ala Ala 
915 920 925 

45 His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 

930 935 940 



Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 

945 950 955 960 

Asn Asp Ser Leu Asn Ser Val Ser Ser Ser Asp Gly Tyr Gly Lys Arg 

965 970 975 



Gly Gin Met Lys Pro Ser lie Glu Ser Tyr Ser Glu Asp Asp Glu Ser 
55 980 985 990 

Lys Phe Cys Ser Tyr Gly Gin Tyr Pro Ala Asp Leu Ala His Lys lie 
995 1000 1005 

60 His Ser Ala Asn His Met Asp Asp Asn Asp Gly Glu Leu Asp Thr Pro 

1010 1015 1020 



lie Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser Gly Arg 
1025 1030 1035 1040 

Gin Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys His lie lie 
1045 1050 1055 
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Glu Asp Glu lie Lys Gin Ser Glu Gin Arg Gin Ser Arg Asn Gin Ser 
1060 1065 1070 

Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr Asp Asp Lys His Leu Lys 
5 1075 1080 1085 

Phe Gin Pro His Phe Gly Gin Gin Glu Cys Val Ser Pro Tyr Arg Ser 
1090 1095 1100 

10 Arg Gly Ala Asn Gly Ser Glu Thr Asn Arg Val Gly Ser Asn His Gly 

1105 1110 1115 1120 



15 



30 



45 



60 



lie Asn Gin Asn Val Ser Gin Ser Leu Cys Gin Glu Asp Asp Tyr Glu 
1125 1130 1135 

Asp Asp Lys Pro Thr Asn Tyr Ser Glu Arg Tyr Ser Glu Glu Glu Gin 
1140 1145 1150 



His Glu Glu. Glu Glu Arg Pro Thr Asn Tyr Ser lie Lys Tyr Asn Glu 
20 1155 1160 1165 

Glu Lys Arg His Val Asp Gin Pro lie Asp Tyr Ser lie Leu Lys Ala 
1170 1175 1180 

25 Thr Asp lie Pro Ser Ser Gin Lys Gin Ser Phe Ser Phe Ser Lys Ser 

1185 1190 1195 1200 



Ser Ser Gly Gin Ser Ser Lys Thr Glu His Met Ser Ser Ser Ser Glu 
1205 1210 1215 

Asn Thr Ser Thr Pro Ser Ser Asn Ala Lys Arg Gin Asn Gin Leu His 
1220 1225 1230 



Pro Ser Ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lys Ala Ala Thr 
35 1235 1240 1245 

Cys Lys Val Ser Ser lie Asn Gin Glu Thr lie Gin Thr Tyr Cys Val 
1250 1255 1260 

40 Glu Asp Thr Pro lie Cys Phe Ser Arg Cys Ser Ser Leu Ser Ser Leu 

1265 1270 1275 1280 



Ser Ser Ala Glu Asp Glu lie Gly Cys Asn Gin Thr Thr Gin Glu Ala 
1285 1290 1295 

Asp Ser Ala Asn Thr Leu Gin lie Ala Glu lie Lys Glu Lys lie Gly 
1300 1305 1310 



Thr Arg Ser Ala Glu Asp Pro Val Ser Glu Val Pro Ala Val Ser Gin 
50 1315 1320 1325 

His Pro Arg Thr Lys Ser Ser Arg Leu Gin Gly Ser Ser Leu Ser Ser 
1330 1335 1340 

55 Glu Ser Ala Arg His Lys Ala Val Glu Phe Ser Ser Gly Ala Lys Ser 

1345 1350 1355 1360 



Pro Ser Lys Ser Gly Ala Gin Thr Pro Lys Ser Pro Pro Glu His Tyr 
1365 1370 1375 

Val Gin Glu Thr Pro Leu Met Phe Ser Arg Cys Thr Ser Val Ser Ser 
1380 1385 1390 



Leu Asp Ser Phe Glu Ser Arg Ser lie Ala Ser Ser Val Gin Ser Glu 
65 1395 1400 1405 

Pro Cys Ser Gly Met Val Ser Gly lie lie Ser Pro Ser Asp Leu Pro 
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1410 1415 1420 

Asp Ser Pro Gly Gin Thr Met Pro Pro Ser Arg Ser Lys Thr Pro Pro 
1425 1430 1435 1440 

5 

Pro Pro Pro Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys Asn Lys 
1445 1450 1455 

Ala Pro Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala Val 
10 1460 1465 1470 

Asn Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Asp Thr Leu 
1475 1480 1485 

15 Leu His Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser 

1490 1495 1500 



20 



35 



50 



65 



Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe lie Gin Lys Asp Val 
1505 1510 1515 1520 

Glu Leu Arg lie Met Pro Pro Val Gin Glu Asn Asp Asn Gly Asn Glu 
1525 1530 1535 



Thr Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu 
25 1540 1545 1550 

Ala Glu Lys Thr lie Asp Ser Glu Lys Asp Leu Leu Asp Asp Ser Asp 
1555 1560 1565 

3 0 Asp Asp Asp He Glu He Leu Glu Glu Cys He He Ser Ala Met Pro 

1570 1575 1580 

Thr Lys Ser Ser Arg Lys Ala Lys Lys Pro Ala Gin Thr Ala Ser Lys 
1585 1590 1595 1600 



Leu Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys 
1605 1610 1615 



Leu Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lys His Val Ser Phe 
40 1620 1625 1630 

Thr Pro Gly Asp Asp Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro 
1635 1640 1645 

4 5 He Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr He Glu Ser 

1650 1655 1660 



Pro Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin 
1665 1670 1675 1680 

Ser Gly Glu Phe Glu Lys Arg Asp Thr He Pro Thr Glu Gly Arg Ser 
1685 1690 1695 



Thr Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr He Pro Glu 
55 1700 1705 1710 

Leu Asp Asp Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He 
1715 1720 1725 

6 0 Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val Lys 

1730 1735 1740 



Lys He Met Asp Gin Val Gin Gin Ala Ser Ala Ser Ser Ser Ala Pro 
1745 1750 1755 1760 

Asn Lys Asn Gin Leu Asp Gly Lys Lys Lys Lys Pro Thr Ser Pro Val 
1765 1770 1775 



# 
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Lys Pro lie Pro Gin Asn Thr Glu Tyr Arg Thr Arg Val Arg Lys Asn 
1780 1785 1790 

Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg Val Phe Ser Asp Asn 
5 1795 1800 1805 

Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn Asn Ser Lys Asp Phe Asn 
1810 1815 1820 

10 Asp Lys Leu Pro Asn Asn Glu Asp Arg Val Arg Gly Ser Phe Ala Phe 

1825 1830 1835 1840 



15 



30 



45 



60 



Asp Ser Pro His His Tyr Thr Pro He Glu Gly Thr Pro Tyr Cys Phe 
1845 1850 1855 

Ser Arg Asn Asp Ser Leu Ser Ser Leu Asp Phe Asp Asp Asp Asp Val 
1860 1865 1870 



Asp Leu Ser Arg Glu Lys Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys 
20 1875 1880 1885 

Glu Ser Glu Ala Lys Val Thr Ser His Thr Glu Leu Thr Ser Asn Gin 
1890 1895 1900 

25 Gin Ser Ala Asn Lys Thr Gin Ala He Ala Lys Gin Pro He Asn Arg 

1905 1910 1915 1920 



Gly Gin Pro Lys Pro He Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser 
1925 1930 1935 

Ser Lys Asp He Pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin 
1940 1945 1950 



Asn Phe Ala He Glu Asn Thr Pro Val Cys Phe Ser His Asn Ser Ser 
35 1955 1960 1965 

Leu Ser Ser Leu Ser Asp He Asp Gin Glu Asn Asn Asn Lys Glu Asn 
1970 1975 1980 

4 0 Glu Pro He Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu Pro Ser 

1985 1990 1995 2000 



Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His Val Glu Asp 
2005 2010 2015 

Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser Ser Leu Ser He 
2020 2025 2030 



Asp Ser Glu Asp Asp Leu Leu Gin Glu Cys He Ser Ser Ala Met Pro 
50 2035 2040 2045 

Lys Lys Lys Lys Pro Ser Arg Leu Lys Gly Asp Asn Glu Lys His Ser 
2050 2055 2060 

55 Pro Arg Asn Met Gly Gly He Leu Gly Glu Asp Leu Thr Leu Asp Leu 

2065 2070 2075 2080 

Lys Asp He Gin Arg Pro Asp Ser Glu His Gly Leu Ser Pro Asp Ser 
2085 2090 2095 



Glu Asn Phe Asp Trp Lys Ala He Gin Glu Gly Ala Asn Ser He Val 
2100 2105 2110 



Ser Ser Leu His Gin Ala Ala Ala Ala Ala Cys Leu Ser Arg Gin Ala 
65 2115 2120 2125 

Ser Ser Asp Ser Asp Ser He Leu Ser Leu Lys Ser Gly He Ser Leu 
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2X30 2135 2140 

Gly Ser Pro Phe His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr 
2145 2150 2155 2160 

5 

Ser Asn Lys Gly Pro Arg lie Leu Lys Pro Gly Glu Lys Ser Thr Leu 
2165 2170 2175 

Glu Thr Lys Lys lie Glu Ser Glu Ser Lys Gly lie Lys Gly Gly Lys 
10 2180 2185 2190 

Lys Val Tyr Lys Ser Leu lie Thr Gly Lys Val Arg Ser Asn Ser Glu 
2195 2200 2205 

15 lie Ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser lie 

2210 2215 2220 



20 



35 



50 



65 



Ser Arg Gly Arg Thr Met lie His lie Pro Gly Val Arg Asn Ser Ser 
2225 2230 2235 2240 

Ser Ser Thr Ser Pro Val Ser Lys Lys Gly Pro Pro Leu Lys Thr Pro 
2245 2250 2255 



Ala Ser Lys Ser Pro Ser Glu Gly Gin Thr Ala Thr Thr Ser Pro Arg 
25 2260 2265 2270 

Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser Pro Val Ala Arg Gin 
2275 2280 2285 

3 0 Thr Ser Gin He Gly Gly Ser Ser Lys Ala Pro Ser Arg Ser Gly Ser 

2290 2295 2300 



Arg Asp Ser Thr Pro Ser Arg Pro Ala Gin Gin Pro Leu Ser Arg Pro 
2305 2310 2315 2320 

He Gin Ser Pro Gly Arg Asn Ser He Ser Pro Gly Arg Asn Gly He 

2325 2330 2335 



Ser Pro Pro Asn Lys He Ser Gin Leu Pro Arg Thr Ser Ser Pro Ser 
40 2340 2345 2350 

Thr Ala Ser Thr Lys Ser Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser 
2355 2360 2365 

45 Pro Gly Arg Gin Met Ser Gin Gin Asn Leu Thr Lys Gin Thr Gly Leu 

2370 2375 2380 



Ser Lys Asn Ala Ser Ser He Pro Arg Ser Glu Ser Ala Ser Lys Gly 
2385 2390 2395 2400 

Leu Asn Gin Met Asn TVsn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu 
2405 2410 2415 



Ser Arg Met Ser Ser Thr Lys Ser Ser Gly Ser Glu Ser Asp Arg Ser 
55 2420 2425 2430 

Glu Arg Pro Val Leu Val Arg Gin Ser Thr Phe He Lys Glu Ala Pro 
2435 2440 2445 

60 Ser Pro Thr Leu Arg Arg Lys Leu Glu Glu Ser Ala Ser Phe Glu Ser 

2450 2455 2460 



Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Gin 
2465 2470 2475 2480 

Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu Ser Thr His 
2485 2490 2495 




wo 98/05347 PCT/US97/12677 

-60- 

Ser Ser Val Gin Ala Gly Gly Trp Arg Lys Leu Pro Pro Asn Leu Ser 
2500 2505 2510 

Pro Thr lie Glu Tyr Asn Asp Gly Arg Pro Ala Lys Arg His Asp lie 
5 2515 2520 2525 

Ala Arg Ser His Ser Glu Ser Pro Ser Arg Leu Pro lie Asn Arg Ser 
2530 2535 2540 

10 Gly Thr Trp Lys Arg Glu His Ser Lys His Ser Ser Ser Leu Pro Arg 

2545 2550 2555 2560 



15 



30 



45 



60 



Val Ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser lie Leu Ser Ala 
2565 2570 2575 

Ser Ser Glu Ser Ser Glu Lys Ala Lys Ser Glu Asp Glu Lys His Val 
2580 2585 2590 



Asn Ser lie Ser Gly Thr Lys Gin Ser Lys Glu Asn Gin Val Ser Ala 
20 2595 2600 2605 

Lys Gly Thr Trp Arg Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn 
2610 2615 2620 

2 5 Ser Thr Ser Gin Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu Ser 

2625 2630 2635 2640 



Lys Thr Leu lie Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp 
2645 2650 2655 

Val Trp Val Arg lie Glu Asp Cys Pro lie Asn Asn Pro Arg Ser Gly 

2660 2665 2670 



Arg Ser Pro Thr Gly Asn Thr Pro Pro Val lie Asp Ser Val Ser Glu 
35 2675 2680 2685 

Lys Ala Asn Pro Asn lie Lys Asp Ser Lys Asp Asn Gin Ala Lys Gin 
2690 2695 2700 

4 0 Asn Val Gly Asn Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Asn 

2705 2710 2715 2720 



Arg Leu Asn Ser Phe lie Gin Val Asp Ala Pro Asp Gin Lys Gly Thr 
2725 2730 2735 

Glu lie Lys Pro Gly Gin Asn Asn Pro Val Pro Val Ser Glu Thr Asn 
2740 2745 2750 



Glu Ser Ser lie Val Glu Arg Thr Pro Phe Ser Ser Ser Ser Ser Ser 
50 2755 2760 2765 

Lys His Ser Ser Pro Ser Gly Thr Val Ala Ala Arg Val Thr Pro Phe 
2770 2775 2780 

55 Asn Tyr Asn Pro Ser Pro Arg Lys Ser Ser Ala Asp Ser Thr Ser Ala 

2785 2790 2795 2800 



Arg Pro Ser Gin lie Pro Thr Pro Val Asn Asn Asn Thr Lys Lys Arg 
2805 2810 2815 

Asp Ser Lys Thr Asp Ser Thr Glu Ser Ser Gly Thr Gin Ser Pro Lys 
2820 2825 2830 



Arg His Ser Gly Ser Tyr Leu Val Thr Ser Val 
65 2835 2840 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 5 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CGGAATTCNN NNNNNNNAAC AGCNNNNNNN NNAATGAANN NCAAAGTCTG NNNTGAGGAT 6 0 
CCTCA 65 



10 



20 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH; 6 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: other nucleic acid 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGAATTCGA CTCAGAANNN NNNAACTTCA GANNNNNNAT CNNNNNNNNN GTCTGAGGAT 60 
CCTCA 65 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base pairs 
4 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: other nucleic acid 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
55 CGGAATTCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNTGAGGAT 60 

CCTCA 65 



